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Abstract 

This technical report presents a method for designing a constrained output-feedback model pre- 
dictive controller (MPC) that behaves in the same way as an existing baseline stabilising linear time 
invariant output-feedback controller when constraints are inactive. The baseline controller is cast into 
an observer-compensator form and an inverse-optimal cost function is used as the basis of the MPC 
controller. The available degrees of design freedom are explored, and some guidelines provided for 
the selection of an appropriate observer-compensator realisation that will best allow exploitation of the 
constraint-handling and redundancy management capabilities of MPC. Consideration is given to output 
setpoint tracking, and the method is demonstrated with three different multivariable plants of varying 
complexity. 



A paper based on the work presented in this technical report has been submitted to IEEE Transactions on Automatic Control 
on 12/09/2011 as a regular paper under the title "Designing output-feedback predictive controllers by reverse-engineering 
existing LTI controllers". 
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1 Introduction 



In many cases, a conventional linear time-invariant (LTI) controller already exists for a given application 
where a system designer would consider the use of model predictive control (MPC) to improve performance. 
The ability to handle input and output constraints in a systematic manner is one of the main reasons that 
motivates the use of MPC ( |Maciejowski||2002t[Camacho & Bordons|[2004t|Rawlings & Mayne||2009^ 
and the keystone of its industrial success ( |Qin & Badgwell[[2003) , allowing plants to safely operate more 
closely to the boundaries of their feasible operating regions. 

Whilst the definition of the constraints is usually obvious for a given application, motivated by the physical 
limitations of the plant and performance requirements on the controlled variables, encoding the remaining 
control objectives into the cost function is often unintuitive, especially for a highly cross-coupled MIMO 
plant in the presence of unmeasured disturbances. 

When full state measurements or estimates exist, an inverse-optimal control problem can be constructed 
to obtain a cost function for which the (unconstrained) optimum solution is equivalent to a prescribed state 
feedback gain ( |Kalman[[l964f . It was noted by ( jKreindler & Jameson] |1 972} that a quadratic cost function 
including cross-terms between state and input values could reproduce any multivariable state feedback 
gain, by making the primary control objective to reproduce the state feedback gain. If a state feedback 
gain Kc is within the domain of gains that can be obtained by solving the infinite horizon LQR problem, a 
linear matrix inequality (LMI) problem (IBoyd et aL] |1 994| can be posed to find quadratic cost weightings 
O > and f? > such that the infinite-horizon LQR controller is Kq. in ( |Di-Cairano & Bemporad||2009| 
Pi Cairano & Bemporacfj|2010) an LMI-based method allowing quadratic cost weights to vary throughout a 
finite prediction horizon is proposed allowing reproduction of a wider range of static gains when operating 
in the controller's unconstrained regime and that the behaviour of a dynamic feedback controller can also 
be reproduced by including the original controller dynamics within the plant model. 



In reality, the full state measurements are not always directly available, necessitating further work for 
the system designer: the design of a state observer to obtain an estimate of the state. The observer 
introduces additional dynamics that change the closed-loop behaviour. Moreover, it is known that a "fast" 
observer and a "good" feedback gain do not necessarily combine to give good closed-loop performance 
( |Doyle|[T978t [Doyle & Stein||1979) , so this process is nontrivial. 



To capitalise on the effort already expended during the design of the existing unconstrained LTI output- 
feedback controller, it would be desirable to be able to construct an initial MPC design and state observer, 
which, when used in combination, replicate the unconstrained behaviour of the original controller. 
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In ( |Rowe & Maciejowsklj|1999|[2000) a method was proposed for obtaining an MPC controller with the 
same unconstrained behaviour as an Hoo controller obtained through the loop-shaping procedure of 
(M cFarlane & Glover] |1992[ ), thus inheriting its desirable performance and robustness properties when 
constraints are inactive. This paper instead describes a method in which an observer-compensator 
realisation of an arbitrary stabilising LTI output-feedback controller can be used to obtain the state 
observer, and cost function for the MPC controller. 



The design method presented here relies upon the results of (sBender & Fowell[ [1 985t |Bender| |1 985 
Fowell et an|198 6^, further developed in (lAlaza rd & Apkarian|[T999;,Delmond et al.|[2006} , which show 
an analytical method for obtaining an observer-based realisation of an arbitrary linear time-invariant, 
stabilising, output feedback controller, and the methodology is motivated by the cross standard form (CSF), 
introduced in ( |Alazard[[2002[ ). The CSF is an inverse-optimal generalised plant model with the optimal 
"^2 and Hoc controllers both equal to the observer-based realisations of a pre-specified output feedback 
controller Kq of order greater than or equal to the order of the plant. A discrete-time variant of the CSF of 
dAlazardl |2002) is defined in ( [Voinot et ar||2003) , whilst in ( jPelmond et al.||2006^ the continuous-time 
CSF is generalised to accommodate the case where the baseline controller is of lower order than the 
controlled plant. 



It was proposed in ( [MaciejowskH |2007| that a similar method can also be used as the starting point 
for a model predictive controller that, by construction, will exhibit the same behaviour as the original 
feedback controller in the absence of active constraints. This paper builds upon the work of (| Maciejowsklj 
2007t [Hartley & MaciejowskH |2009t [Joosten & Maciejowski] |2009t [Hartleyl |2010 ) by further addressing 
the effects of the choices of the non-unique realisation of the original controller in the context of the 
resulting constrained predictive controller. Furthermore, a method is proposed, with which reference- 
tracking controllers can also be reverse-engineered and cast into the MPC framework. The introduction of 
unwanted cross-coupling between seemingly unrelated control loops is also explained, and methods for 
avoiding this are proposed. 



Three case studies are presented demonstrating the effectiveness of the procedure: 



1. Attitude control of a satellite with redundant torque pairs 



2. Control of an inverted pendulum and 



3. 



Control of a large airliner 
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2 Observer-based controller realisations 

MPC controllers are implemented on digital computers and usually operate in discrete time. Also, the 
original controller could be of lower order than the plant model (for example, a proportional-integral 
controller), so a discrete-time variant of the method of pelmond et al.[[2006| for low order controllers is 
used to obtain the observer-compensator realisation. The observer will be implemented directly, and the 
state feedback gain Kc used as the basis for an MPC controller to which constraint handling will be added 
(Figure [1). 



G(z) 



K{z) 



G{z) 



K, 



G(z) 



MPC 


X 


Gobs(^) 


< 



(a) Original system 



(b) Observer based realisation 



(c) IVIPC controller 



Figure 1 : Schematic representation of the reverse-engineering procedure 

The principles for obtaining the observer-based realisation are now presented in sufficient detail to 
motivate the decisions made to obtain a satisfactory constrained controller. Refer to the appendix for 
proofs. Consider a linear discrete-time state-space plant model G(z) of order n with ny outputs and 
inputs, with pair {C,A) observable, and an existing stabilising LTI controller Kq{z) expressed as its minimal 
realisation with order hk < n, 



G(z) 



A 


B ' 


C 






, Ko{z) 



Ak 


Bk ' 


Ck 


Dk _ 



There are two main structures for discrete-time observers of the plant G(z) (Alazard & Apkarian 1999 



Teixeira[[2008^ . These differ in whether the current measurements affect the current state estimate or not. 



Definition 1 (Fiiter form observer) A filter structure discrete time observer provides an a posteriori 
filtered estimate of thie current piant state, and takes thie form 



x{k+^\k) = {A- AKfC)x{k\k - 1 ) + Bu{k) + AKfy{k) 
x{k\k) = {!- KfC)x{k\k - 1) + Kfy{k) 



where Kf is an appropriateiy cfiosen observer gain matrix and x{k\k) is used to compute u{k) . 
Remari< 1 Measurements from thie current time step contribute to thie estimate of thie current state. Thiis 
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is useful if thie sampling period is significantly longer than the time required to compute the control action 
(i.e. can be modelled as a direct feed-through). 

Definition 2 (Predictor-form observer) The predictor structure provides an a priori prediction of the 
plant state by using the output measurements from the previous time step and takes the form 

x{k+^\k) = {A- KfC)x{k\k - 1 ) + etv(/c) + Kfy{k) 

where Kf is an appropriately chosen observer gain matrix such that {A - KfC) is stable. 

Remaric 2 The state estimate x{k\k - ^) is used for control purposes — i.e. to compute u{k) — at each 
time step. By using the estimate of x at time k given measurements from time k - ^ as the boundary 
condition for the beginning of the optimised trajectory, the optimisation associated with the MPC controller 
can commence before time k, allowing a time period equal to the sampling period for the computation to 
be performed. 

Remari< 3 A controller with a non-zero direct feedthrough term Dk (i.e. a controller that is not strictly 
proper) cannot be directly reproduced using the estimate from a discrete-time predictor. Techniques to 
avoid this limitation will be described in Section \4~^ In this section, assume that some transformation on 
(or modification to) the system has already been performed to ensure that the controller is strictly proper, 
yielding 



G(z) = 



A 


~B ' 


C 






Ko{z) = 













Theorem 1 For a filter- form observer-based controller, /<obs(^) = 
implies that Kobs{0) = 0. 

Proof Consider tlie following observer-based controller: 

^obs('^) = 



Ao 


Bo ' 


Co 


Do 



, Do = CoAo^Bo, which 



{A+ BKc){l - KfC) 


{A+BKc)Kf ' 


Kail -KfC) 


KcKf 



(1) 



Noting that /\o = (' - KfC)-'^{A + BKc)'^ it is clear that Do = CgAg^ Bo, which implies that Do + 



Co{zl - Ao)^ Bo = 0, when z = 0. ■ 

Remarl< 4 For the observer-based realisation to be a realisation of a controller Ko{z), Ko{z) must have 
this same property. Therefore through term Dk can only be correctly reproduced in discrete-time filter 



observer-based form if Dk = CkA^^^ Bk jlBender& Fowell 1985 ). 
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Given a matrix T witli full row rank, let the notation 7^^ indicate a right inverse of T, indicate the 
Moore-Penrose pseudo-inverse of T, and 7-*- indicate a matrix whose columns form an orthonormal basis 
for the nullspace of 7. 

Lemma 1 Given a matrix Kc = D^C + CkT and a matrix Kf such thatAKf = PBk - BD^, and assuming 
Dk = CkA]^^ Bk, T is of full row-rank and - T(A + BDkC) - TBCk T+BkC + AkT = 0, then it holds that 
KcKf = Dk- 

Proof 

KcKf = {DkC+CkT)A-\PBk - BDk) 

= {CkA-^' BkC + CkT)A-\PBk - BCkA~^' Bk) 
= Ck{A-^'BkC+T)A-\P - BCkA-^')Bk 

= CkA-^\BkC + AkT)A-\P - BCkA-^')Bk. (2) 
By rearrangement and factorisation, 

TA=- TBDkC - TBCk T+BkC + AkT 
= {I-TBCkA-^'){BkC + AkT) 
:. BkC + AkT = {I-TBCkA]^^)~^TA. (3) 

Therefore: 

KoKf = CkA-\I - TBCkA-^Y' TAA-\P - BCkA-^')Bk 
= CkA-\I-TBCkA-^Y\I- TBCkA-^')Bk 
= CkA^^ Bk 

= Dk. (4) 



Theorem 2 Let Kq{z) be a stabilising linear-time-invariant controller for the plant G{z) where nK < n 
and Kb(0) = 0. Assume that there exists T e R^^xn Qffyjj ^qi/i/ rank satisfying the non-symmetric Riccati 
equation 

-T{A+BDkC)-TBCkT+BkC + AkT=Q (5) 

and that det(/\) ^ and det(/\K-) ^ 0, then Kq{z) can be realised in a filter observer-based form with 
observer gain Kf such that AKf = PBk - BDk, 3nd state-estimate feedback gain Kc = DkC + Ck 7. 
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Proof Consider the (non-minimal) realisation, 





' Ak 





Bk 


Koiz) = 


Aek 


Ae 


Be 




[ Ck 





Dk 



(6) 



where Aek g r("-"'<)^"'< and Ae g R{"-"K)x(n-nK) ^ ^(n-nK)xny ^^^q arbitrary values. Given that 

T 



T is of full row rank, the matrix 
co-ordinates of (Q) which gives 



where 



is invertible, with inverse 



-+ T-L 



■^obs 


Sobs 


Cobs 


^obs 



■^obs 



with 



and 



Sobs 
Cobs 



M^=A- AKfC + BKc - BKcKfC 



TAKf + TBKcKt 



Consider the change of 
(7) 

(8a) 
(8b) 



cr\f 



T^^AKf+ T^^BKcKf 



{Kc - KcKfC) {Kc - KcKfC) T' 



Dobs = KcKf = Dk. 
Let Kc = DkC+ CkT and AK, = PBk - BDk then 

7Mi = TA + TBDkC - BkC + TBCkT 



(8c) 

(8d) 
(8e) 



(8f) 



B 



•obs 



Bk 
T^'PB, 



{8g) 
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and 



■'obs 



{Kc-DkQT- {Kc-DkQT^ 
Ck . 



(8h) 



If Is) holds then TM^ = AkT and TM^ 7+ = Ak, and TM^ = 0. System (T) is related to by similarity 
transformation, and is equal to ^ with 



Aek= T^\A-T^BkC+BDkC+BDkC + BCkT-BDkC)T^ 

= T^\a + BDkC- P BCk)T^ + T^^ BCk 
Ae = T^^{A - AKfC + BKc - BKcKfQT^ 

= T^^A- PBkC- BDkC+ BCkT+ BDkC- BDkQT^ 

= T^\a + BDkC- PBkQT^ 
Be = T^^PBk 



(9a) 



(9b) 
(9c) 



which in turn is a (non-minimal) realisation of Ko(z). Therefore (Q) is a realisation of Ko(z). ■ 

Theorem 3 Let ko{z) be a strictly proper, stabilising linear-time-invariant controller for the plant G{z) 
where nK < n. Assume that there exists T g R"Kxn gf fuu j-q^ /-g^^/^ satisfying the non-symmetric Riccati 
equation 

-TA-T~BCkT+~BkC + AkT = 0. (10) 

Then Ko{z) can be realised in a discrete-time predictor observer-based form with the observer gain 
Kf = PBk and the state-estimate feedback gain Kc = CkT. 

Proof The unconstrained observer-based controller is of the form: 

A-KfC + BKc Kf 



^obs('^) = 



Consider a non-minimal realisation of Ko(z) with state vector 







(11) 



of the form 





' Ak 





Bk 


Ko{z) = 


Aek 


Ae 


Be 




[ Ck 









(12) 



where Aek g M^"-"^)^"^, Ae g rC-^^Ix and Be g RC-^^jx"/. Given that T is of full row rank, the 
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matrix 



which gives 



where 



is invertible, with inverse 



7+ T' 



Consider the similarity transformation on (1 1 



/^obs(^) = 



■^obs 


Sobs 


C'obs 






"*obs 



with Ml = {A - KfC + BKc), 



Bobs 
C'obs 



TKf 



By substituting Kc= CkT and Kf = PB^, 



M2T^+TBCk M2T^ 
T^^M^T^ + BCk T^^M3 



Bk 
n'PBK 

Ck 



with M2 = {TA- BkC), and M^ = (A - PBkC), 

Bobs 
C'obs 

If TA + TBCkT- BkC = AkT then it follows that 

{TA-BkC)T^ + TBCk = Ak. 

It also follows that 

{TA-~BkC)T^ = (AkT - TBCkT)T^ 
= 0. 



(13) 



(14a) 



(14b) 
(14c) 



(15a) 



(15b) 
(15c) 

(16) 



(17) 



The observer-based controller ( pM) with the specified Kc, Kf and conditions on T is related by similarity 
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transformation to a system that is identical to ([T2]) wlien: 



Be=T^^PBk (18a) 
Aek = T^\A- P'BkQT^ + ~BCk (18b) 
Ae=T^\A-P~BkC)T^. (18c) 



Remark 5 Equations ^ and |T0| ) can more conveniently be written in matrix form 



T I 



Ar 



= 



(19) 



wtiere Ac\ is the state update matrix for the original closed loop system, 



Ar\ = 



A + BDkC BCk 
BkC Ak 



or 



A BCk 
~BkC Ak 



(20) 



Wlien nK < n, the observer-based controller realisation has n- nK more closed loop modes than the 
original controller. These correspond to the eigenvalues, with corresponding eigenvectors in the null-space 
of T. Whilst the dynamics of the output of the observer-based controller do not depend on these, once 
Kc = CkT + DkC is replaced by a constrained MPC controller, this will no longer be the case. The value 
of Kf is non-unique because 7"^^ is non-unique. Matrix Ae is also non-unique for both of the described 
formulations. Theorem |4] shows how these extra poles that are introduced into the closed loop system 
are determined by 7"t and that they may be used to tune closed-loop constrained performance when 
constrained MPC is used instead of static gain Kc. 



Theorem 4 The n-nK additional modes in the observer error dynamics can be determined by the choice 
of a predictor-form observer gain for the plant 



A + BDkC 


t^'b' 


BkCT^ 






or 



A 


t^'b' 


~BkCT^ 






(21) 



Proof The n-nK additional modes in observer error dynamics are associated with an invariant subspace 
o\ A + BDkC - AKfC in the nullspace of T and are determined by the eigenvalues of Ae- Given that 
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AKf =PBk- BDk and that P = + T^X, 



T^Ua- PBkC + BDkC] t 



= T^ (^A-{P + T^X)BkC + BDkCJ T 
= T^\a + BDkQT^ - XBkCT^. 



(22) 



The matrix X is then treated as an observer gain, and can be designed by pole placement, or by Kalman 
filter methods on the system ([21). The proof for the predictor form is analogous. ■ 

There are multiple solutions to the non-symmetric Riccati equations, which can be obtained using invariant 
subspace methods |Laub|[T979) . 



Theorem 5 If the plant model G{z) has n states, and the original controller Kq{z) has n^ states, then 
(letting lm( • ) denote the image operator), given an n-dimensional invariant subspace, S c C""^"*^ of Ad, 



(23) 



the columns can be partitioned vertically so that 



Un 



U2 



(24) 



where e C"^" and Uz g C"*^^". T = UzU^^ is a solution to the non-symmetric Riccati equation ^ or 



Proof Because 



is an invariant subspace. 



A + BDkC 


BCk 










BkC 


Ak 




U2 




U2 



A. 



(25) 



Postmultiplying both sides by ^ yields 



A + BDkC 


BCk 




1 




1 


BkC 


Ak 




U2U;' 




U2U;' 



(26) 
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By defining 7 = L/2L/-1 ^ and premultiplying botli sides by 
non-symmetric Riccati equation 



T I 



T I 



, it follows that T is the solution to the 









1] 


A+BDkC 


BCk 




' 1' 




BkC 


Ak 




T 



= 0. 



(27) 



Theorem 6 The poles of the pure state feedback system, {A + BKc) with Kc calculated as in Theorem\^ or 
\^are equal to the eigenvalues corresponding to the eigenvectors which comprise the invariant subspace 
S = \m[Ul, UlY if T = U2U^\ 



(28) 



Proof By considering ([26) it can be seen that 

A + BDkC + BCkT = A + BKc= U^AU^^ . 

Therefore, a{A + BKc) = o-(A)- ■ 

Corollary 1 The remaining nK closed loop poles from the original system, along with the n- nK modes 
introduced if the observer is of higher order than the original controller therefore correspond to the 
observer error dynamics. 

Remark 6 A real solution T will not exist if the partition of closed-loop poles between pure state feedback 
and observer dynamics is not compatible with the controllability and observability properties of the original 
plant. Also, complex conjugate pole pairs should not be split (Bender & Fowell[ \ 198^\Alazard & Apkarian\ 



The resulting observer-based controller using Kc and Kf is a (non-minimal when nK < n) realisation of 
the original controller. The closed loop system using the observer based realisation of the controller 
will contain n- nK poles that did not exist in the original closed loop system. These dynamics can be 
assigned by the system designer through the choice of 7+ used to calculate Kf. 



3 Model predictive controller formulation 



At the heart of every MPC controller is a constrained optimisation problem, where the summation of a 
stage cost function of the plant input and state is optimised over a prediction horizon of length N, subject 
to input and state constraints ( |Maciejowski||2002| : rCamacho & Bordons[|2004|[Rawlings & Mayne[|2009) . 
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Let N be the length of the prediction horizon, i{x, u) be the "stage cost" at each time step, and F/v(x) 
be cost on the state at the end of the finite prediction horizon. Define X, Y, U and T to be the set of 
feasible state values, the set of feasible output values, the set of feasible input values, and the terminal 
constraint set respectively. A basic model predictive control formulation is outlined in Algorithm [1] using 
the shorthand notation x{k) to be the prediction of the state x k time steps into the future from the 
current state estimate or measurement, and u{k) analogously. For notational convenience, define: 



x= x(0)^ ••• x{Ny 



u= iv(0)^ ••• u{N-^y 



Algorithm 1 : Model predictive control 



while controller running do 

1 Sample state measurement or estimate x{t). 

2 Solve 



arg min Fn (x(A/)) + ^ i^W, u{k)) 



subject to plant dynamics 



x{k+^) = f{x{k),u{k)) 
y{k+^) = g{x{k),u{k)) 



and constraints 



x(0) 
x{k) 

y{k) 
u{k) 
x{N) 



= x{t) (Current state measurement/estimate) 



gX V/c g {0,..., a/- 1} 

gY V/c g {0,...,A/- 1} 

gU V/c g {0,...,A/- 1} 
G T. 



4 



3 



Apply t/(0) to plant. 
Wait sampling time Tg. 



end 
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For the case of a linear time invariant plant models, as used for the reverse-engineering: 

f{x{k),u{k)) = Ax{k) + Bu{k) 
g{x{k),u{k))=Cx{k). 



3.1 Zero-value cost-function 



The reverse engineering procedure hinges upon replacing the static gain Kc with a constrained MPC con- 
troller that is, when constraints are not active, equivalent to the estimated state feedback KcX obtained for 
the discrete-time observer-based controller. A zero-value infinite horizon cost function can be constructed 
to ensure that u{k) = Kcx{k) is the optimal solution by using a stage cost 



£{x{k),u{k)) = 



'x{k) 


T r 


u{k) 





KjRKa 

-RKc 



-KJR 
R 



x{k) 
u{k) 



(29) 



where f? > is a weighting matrix, determining the relative importance of matching each input to that 
provided by the original controller ( [Kreindler & James6rij|1972} . A standard MPC implementation performs 
an optimisation over a finite, but receding horizon. A finitely parameterised infinite horizon cost function 
can be obtained by using the candidate cost function over a finite horizon of length N and using the 



solution P to the discrete-time algebraic Riccati equation as a terminal quadratic cost weighting dRawlings 
& Muske[[l993t|Chmielewski & Manousiouthakis||1996} . 



Theorem 7 When using the stage cost ^2^, P = isa solution to the associated discrete time algebraic 
Riccati equation (DARE). 



(30) 



Proof The discrete time algebraic Riccati equation associated with stage cost (|29j is 

A'^PA -P - ( A'^PB - KIr] ( B^PB + rYU B^ PA - RK.] + KI RK. = 0. 



By construction, the optimal state feedback gain is, Kc = - [B^PB + R) ^ [B^PA - RKc), so 

A^PA -P+( A'^PB - KIr] Kr + KiRKr 



= 

A'^PA -P + A^PBKc = 
A^P{A + BKc) - P = 0. 



(31) 



By inspection, P = is a solution. 
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Remark 7 Because of this unsurprising result, no terminal cost need be added to the finite horizon 
optimisation. 

Remark 8 When input constraints are active, the MPC implementation is interpreted as "regulating" to 
the region of the state space where u{k) = KcX{k) is feasible. This is more intelligent than merely "clipping" 
the control actions on input saturation. When output constraints are active, the control objective is to avoid 
constraint violations whilst minimising deviations from the unconstrained control actions over the horizon. 



When there is input redundancy, if one control actuator fails or saturates, the controller should be capable 
of using other control inputs to achieve a similar control effect. Under nominal operating conditions 
retaining the behaviour of the original controller is also desirable. To meet these objectives, a possible 
quadratic cost function is 



for f?i > 0, and Oi » R. The first term, tries to achieve the control effect of the original controller, whilst 
the second term ensures that when feasible, the original control actuator configuration is used. This cost 



Remark 9 Despite the resemblance, the predictive nature of the MPC implementation means that there 
is an anticipatory aspect to the control decision rather than a best-effort attempt to deliver a particular 
control effect at the current time step as would occur with a pure actuator allocation algorithm. 



Alternative zero-value cost functions such as 



£{x{k), u{k)) = \\Bu{k) - BKaX{k)\\% + \\u{k) - KaX{k) 



function can alternatively be expressed in the form of ([29) as: 



£{x{k),u{k)) = 



K^{R^ + e^Oi B)Kc -Kj{R^ + B^Q^ B) 
-{R^ + B^Q^ B)Kc (Ri + S^Oi B) 



(32) 



l{x{k),u{k)) = \\R{u-KcX)h 
ox£{x{k),u{k)) = \\R{u-KcX)\l 



oo 



(33a) 
(33b) 



or a combination of the two, can also be used depending on the desired control objectives when the 
baseline controller does not satisfy constraints ( [Rao & Rawlings[|2000^ . 
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3.2 Other cost functions 

If there happens to be a valid O > and R >0, with S = such that Kc is the optimal infinite-horizon 
discrete-time state feedback minimising the DLQR cost function 

oo 

^x{kfQx{k) + u{kfR{u) + 2x{k)^Su{k) (34) 



then LMI-based inverse optimality methods poyd et arj|1994} can be used. Even if this is not the case 



the cross term S can be minimised using LMI-based methods (Algorithm [2). In the general case though, 
the cross-terms are not guaranteed to be driven to zero, however the resulting cost function can provide 
different constrained closed-loop behaviour. 

Algorithm 2: Minimising cross terms using an LMI 
For a fixed e > 0, and compatibly sized O, P, R, S minimise ||S||2 subject to: 

O > (35a) 

P > (35b) 

R>el (35c) 

A'^PA - P - KjiB'^PB + R)Kc +0 = 0; and (35d) 

{B'^PB+R)Kc + {B^PA+S^) = 0. (35e) 



An approximate match to the original control gain Kc can be obtained by using Algorithm [2] and setting 
S = a posteriori and calculating: 

7Cc = -dlqr(Ae,0,P). (36) 

The suitability of the newly synthesised controller for the given application is not guaranteed and would 
have be verified experimentally. 



Remark 1 Ttiis mettiod for minimising S is not suitable wtien ttie cost function (32^ tias been used for 
a plant with redundant inputs. The elements of R corresponding to redundant actuators that are not 
normally used will be forced towards infinity. 

Remark 11// the plant model has been augmented with a disturbance model, the disturbance states 
are uncontrollable. The cost-function re-shaping should be performed minimising only the cross-terms 
between inputs and the controllable states. It is clear that the cross terms between the disturbance states 
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and the input must remain, and in fact, tliis can be interpreted as an implicit input target calculator, as 
commonly used for offset-free MPC (Muske & Badgwell[ \2002j) . There is, however, still no guarantee that 



the remaining terms of S will be forced to zero. 

Remark 12 The cost functions and ^) have clear physical interpretations. On the other hand the 
results of Algorithm^are no so easily interpreted unless the elements ofS are small in relation to the 
other matrices or the controller synthesised with S artificially set to zero is acceptable. 



As an alternative, the method of ( |Di-Cairano & Bemporad[|2009t|Di Cairano & Bemporad[|2010[ ) can also 



be directly applied to increase the set of gains Kc that can be matched without cross-terms between state 
and input, although it is still not guaranteed that a solution will exist for arbitrary Kc. 

Proposition 1 If inverse-optimal cost weightings Q^ and Rq cannot be found for u = KcX{0) to be the 
optimum solution of the one step horizon control problem 

min {Ax{0) + Bu{0)f Q^ {Ax{0) + Bu{0)) + uIRqUq (37) 

u(0) 

then there is no sequence of{Qi, Rj) over any finite prediction horizon N which will give tv*(0) = KcX(O). 

Proof Any finite-horizon optimal control problem with a quadratic cost function (with, or without cross 
terms between inputs and states and different time steps) will have a closed-form matrix quadratic 
expression of form x^Px for the optimum cost. Therefore, if there is no suitable cost-to-go Oi for which the 
minimising control input of ( [37| is K'cX(O) there is no suitable quadratic function of the prediction horizon 
from /c = 1 to /c = A/ (for arbitrary N) either. ■ 



3.3 Constrained stability 



in the presence of constraints, the (unconstrained) stabilising properties of the original controller might 
not be inherited. The usual technique of applying a terminal constraint as in ( |Mayne et"ai7| |2000} can be 
applied directly if formal stability guarantees are required. 



Consider a finite horizon MPC controller using a stage cost function (|29), input constraints u{k) e U, state 
constraints x{k) e X, and a terminal constraint x{N) e T, where T is a recursively feasible positively 
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invariant set under Kc sucli tliat 

x{k) G T =^ Kcx{k) e U (38a) 

x{k) E T =^ x{k) G X (38b) 

x{k) G T =^ y(/c) G Y (38c) 

x(/() G T =^ (/A + BKc)x{k) G T. (38d) 

Assuming exact plant model matching, and that the observer error has converged to zero, the reverse 
engineered MPC controller can be interpreted as a special form of dual-mode MPC controller, with x{k) 
guaranteed to enter T in A/ steps. Inside T, by construction, the MPC controller is equivalent to the original 
LTI output-feedback controller, and therefore inherits its stabilising properties. 



4 Baseline controller transformations 

4.1 Discretisation 

Real-world systems operate in continuous time, and an existing controller might be specified in continuous 



time, but practical implementations of MPC operate in discrete-time with sampled data. In Maciejowski 



( |2007) it was proposed to find a continuous-time observer-compensator based realisation of an original 



continuous-time controller, to implement the observer in continuous time and then sample the output. 
If the gain used to form stage cost ([29| is the same as the continuous time state feedback gain, the 
closed loop system behaviour might be very different to that when using the original controller. Better 
output-performance matching can be achieved by finding an "equivalent" discrete-time cost weighting 
matrix ( [Van Loan[|1978| that minimises an integral cost function whilst being formulated as a discrete time 
problem as done in | |Maciejowski| [2007^ . This however, changes the effective gain of the unconstrained 
controller such that its rows are no longer in the row space of T. As a consequence, the n- dk modes 
corresponding to error dynamics in the nullspace of T will affect even the unconstrained closed loop 
system. This added complication constitutes a strong argument for discretising the baseline controller 
and the plant model first, and directly obtaining the discrete-time observer-based realisation. 

The usual zero-order hold method best models how a simple MPC controller would drive a real plant 
and this should be used for discretising the plant model. For the controller, it might be preferable to use 
a first-order hold or a Tustin transformation (e.g. ( [Franklin et al.[|1990 ^), particularly at low sampling 



frequencies. These can introduce non-zero terms even if none existed in the continuous time controller. 
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This process should, therefore, be performed before any further transformations are performed to comply 
with the restrictions of the observer-based realisations. 



4.2 Ensuring a strictly proper controller model 

When a predictor structure is used for the observer, Dk must be zero. Two options are available when Dk 
is non-zero. 



4.2.1 Loop-shifting 



Loop-shifting | |Zhou et alj |1 996) can be used (Figure [2^), leading to the following modified plant and 
controller: 



G(z) = 



Kq{z) = 



A+BDkC 


B ' 


c 






Ak 


Bk ' 


Ck 






(39a) 



(39b) 



However, the direct feedthrough component incorporated into the plant model uses the measured output. 



G(z) 



Dk 



Dk 



G(z) 
Ko(z) 



O7 Ko{z) 




(a) Loop shifting (b) Unit delay 

Figure 2: Techniques to ensure a strictly proper Ko(z) 

whilst an MPC prediction would use the observer output (the output values cannot be extrapolated over 
the prediction horizon without the estimates of unmeasured states). Input constraints might therefore be 
violated, or, control can be overly conservative when the observer error y{k) - Cx{k\k - 1) is large in 
magnitude. 
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4.2.2 Unit delay or low-pass filter 



Alternatively, adding a unit delay or a low pass filter in series with the original controller (Figure |2]d) 
prior to obtaining the observer-based realisation would have the desired effect. By avoiding a direct 
feedthrough, the MPC controller directly manipulates the plant input u{k) rather than an estimate of the 
input, avoiding uncertainty of the "real" value of input u{k). In this case, the conventional controller Ko(z) 
must be sufficiently robust, or the sampling frequency must be high enough for the delay to be tolerated. 



4.3 Ensuring correct zeros in controller 



When using a filter structure, Kq{0) = is required for correct reproduction. If this is not initially the case, 
the required zeros can be artificially introduced by adding a dipole on each channel of the form 

Wz 

(40) 



Wz-^ 



where 1/1/ is a "large" number, into the open-loop controller model. This introduces the required zeros 
whilst at the same time has minimal effect on open loop gains and phase shifts of the unconstrained 
controller (Figure [3). 




Figure 3: Bode plot of dipole gain-phase properties (Tg = 1 s) 
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4.4 Design guidelines 



The choice as to whether the predictor and filter form is most suitable depends upon whether the original 
controller is strictly proper or not, and if it is not, how large the value of Dk is, the length of the sampling 
period and the computational budget. The type of disturbances expected should also be considered. 
Loop-shifting might be considered appropriate if sensor noise dominates the model uncertainty. On the 
other hand if large external disturbances act on the plant, the error between the observer estimate of the 
output and the actual output could make enforcement of input constraints rather difficult, making the filter 
structure a rather more attractive prospect. 



5 Plant model transformations 



5.1 integral action 



Direct reproduction of a controller including integral action for offset-free control through the reverse 
engineering procedure would, by construction, reproduce the input/output characteristics. However, 
the presence of any unmodelled disturbance would manifest itself as a bias on each state estimate — 
problematic for a constrained predictive controller, as poor predictions made from biased state estimates 
will lead to overly conservative control action, or in the worst cases, control action that leads to infeasibility. 
The usual MPC methods of augmenting the plant model with a disturbance model fMuske & Badgwell[ 



2002t|Pannocchia[[2004| ; |Pannocchia & Bemporad[ |2007| can be used, subject to the new, augmented 
plant model being observable. Disturbance models also provide a convenient way in which the order of 
the plant model can be increased when of lower order than the original controller. 



Whilst, in ( [Alazard & Apkarian[|1999^ a method is proposed for finding an observer-compensator- Youla 
Parameter realisation of a controller with order higher than that of the plant model, the inclusion of 
disturbance models will improve the quality of the state estimates, and therefore the quality of the 
predictions in the optimisation. It is noted that there has been recent interest in using a Youla Parameter 
as a means of improving the robustness in constrained MPC ( [Cheng et am2009|[Thomsen et al.[|2010} , 
however the applicability of these methods to the reverse engineering procedure remains an open topic 
for investigation. 
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6 Selection of T and 7"^ 



The allocation of the closed loop poles between the observer error and the (unconstrained) state feedback 
dynamics is an important design decision when implementing an MPC controller in this manner. This 
turns out to be even more important when there is plant-model mismatch as a result of modelling error, or 
of linearisation error stemming from the common practice of using a local linearised model of a non-linear 
plant. 



6.1 Solution T 



There can be a marked difference in the observer error dynamics of different realisations, despite the 
complete controller being identical to the original Kq{z). As previously stated, in the presence of constraints, 
this error can result in a violation of constraints, or overly conservative control (depending on the sign of 
the error), because D/^y is directly fed back to the input of the plant, bypassing the MPC controller, whilst 
the MPC controller has to enforce constraints using an estimate, Cx from the observer. 

These types of error are particularly marked when using loop-shifting to make Ko(z) strictly proper. For 
useful predictions of the state trajectory to the obtained, the quality of the state estimation must be 
sufficiently high. Whilst intuition would suggest (subject to existence of a valid solution of T) that keeping 
the fastest closed-loop poles in the observer error dynamics might be sensible, our third example (Sec- 
tion [To]3} demonstrates that this is not necessarily the case for highly-coupled MIMO plants, particularly 



when using a highly-coupled MIMO plant model obtained by the widespread method of locally linearising 
a nonlinear plant. 

For SISO systems, and small MIMO systems, it is practical to evaluate every feasible solution of T and 
analyse the observer performance "by eye". However (assuming full observability and controllability of 
Go and Kq, no repeated poles (in which case, special care must be taken as to how the subspaces are 
partitioned, or all repeated poles must remain "together"), and no conjugate pole pairs), there are up to 
possible solutions for T. The combinatorial growth of possibilities with the system size therefore 
motivates the suggestion of a fast-to-calculate quantitative metric to rapidly assess the quality of the 
observer. 



Proposition 2 A good observer-gain realisation to use as ttie basis for an MPC controller is that which 
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minimises || Gy^e(z) ||2 x || Gc/^x(z) ||2 wliere, depending on tlie observer realisation 



Predictor form 



A-KfC 


K, ' 


or 


' A{l- 


KfC) 


AKf 


C 


-1 


_ C{l- 


KfC) 


CKf-l 



Filter form 



(41) 



and 



{Z) = 



A- 


KfC 



/ 




A{l- 


KfC) 



/ 






or 






1 








1 











/ 


-/ 







/ 


-/ 



(42) 



Remark 13 Thie term Gy^g is concerned with the effect of measurement noise on the fiitered estimate of 
the measured outputs. Unmeasured states are deliberately omitted in this metric, as no assumption can 
be made regarding the actual values of the unmeasured states, and therefore on the state error The term 
Gfj^x considers the effect of unmeasured (but acknowledged, if not modelled in detail) disturbances on 
the estimated state. Ideally this value should be small. The two terms are multiplied rather than added, 
because no assumption can be made regarding their relative magnitudes. The T-L2 norm is chosen over 
the Tioo norm because the search is being performed over a discrete set, and the latter metric represents 
a "worst case" gain, effectively "hiding" any other behaviour. 

Remark 1 4 The choice of P will affect the chosen metric. Therefore, a simple heuristic for choosing 
this should be decided before performing the search over all feasible combinations T. This is provided 
subsequently 



6.2 Designing V 

The extra dynamics introduced as a consequence of the observer being of higher order than the original 
controller affect the MPC controller performance, despite their associated modes being in the nullspace of 
the initial calculated Kc and thus "invisible" to the plant (in which case ensuring Ae is stable is sufficient 
Pelmond et al.[|2006) ). The solution to a linearly constrained MPC problem with linear or quadratic cost is 



piecewise affine with respect to the current state ( [Bemporad et al.[[2002) . When constraints are active the 



gain component of this function changes, and the observer error modes which were previously invisible in 
the closed loop system will stop being insignificant. 



The examples in Section [To] indicate that placing these "free poles" using Kalman filter methods on 
system ( [2T| rather than attempting to place them near the origin as might be expected to give "fastest" 
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convergence is worthwhile to avoid amplification of noise. As one would reasonably expect, the choice of 
weightings ultimately depends on the disturbances that are expected, but a relatively high measurement 
noise covariance matrix (giving slower observer poles) appears to give the best results. 



7 Reference tracking 



Obs 



MPC 



G(z) 



MPC 



G(z) 



Hpre(z) 



MPC 



G(z) 



(a) Error observer 



I Obs < 1 

(b) Option 2 

Figure 4: Reference tracking 



Obs <r- 



(c) Option 3 



An LTI compensator is often placed in the forward path of a feedback loop rather than the return path. 
The dynamics of the compensator therefore act upon the difference between the reference signal and 
the output signal rather than just the output signal. However, to obtain an estimate of the plant state an 
observer would be placed in the return path. 

Three options are suggested here. The first is to implement the observer in the forward path, and use 
the observer to estimate the error between the plant state and a reference state. Because the prediction 
model in the MPC controller would therefore predict the trajectory of the tracking error rather than the 
plant state, this is sufficient for handling input constraints, but not output or state constraints. 

The second option is to simply implement the observer in the return path and accept a change in the 
transient response to input changes (the disturbance rejection properties will remain unchanged). 

A third solution is to implement the observer in the return path, and to pre-filter the output reference 
set-point with a modified copy of the observer. For the unconstrained case with unmodified controller gain 
Kc, an adequate pre-filter for the predictor form (assuming loop-shifting has not been used) is: 



A-KfC 


Kf ' 


1 






(43) 



Loop-shifting must also be reflected in the reference-tracking structure (Figure [5). The signal Dkt must be 
known to the MPC controller to make correct predictions. The prefilter with loop-shifting used should be: 



Reference: CUED/F-INFENG/TR.671 
Date: 13/09/2011 
Issue: 1 Page: 28 of|56 



Dk - 
Ko{z) FO-^ 
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G(z) 



Dk 



Dk 



Ko{z) 



-O7 Ko{z) 



Dk 



'pre(z) 



MPC 



o- 



G(z) 



Dk 



G(z) 



OBS 



(a) Baseline implementation (b) IVIPC implementation 

Figure 5: Loop-shifting witli reference tracking 



Hpre(z) 



A+{BDk- Kf)C 


Kf - BDk ' 


1 






(44) 



However, tliese pre-filters are not unique, because any signal in tlie null-space of Kc can be added to the 
output of this system yet give identical (unconstrained) closed-loop results. These degrees of freedom can 
therefore be used to force certain elements of the state reference Xr = Hpre{z)r to be equal to elements of 
the original reference signal, or to force certain elements of the state reference to always be zero. The 
latter is useful if one desires that the reference setpoint not include any open-loop unstable directions. 
Letting r{k) be the original reference signal, and Xpre(/c) be the state of system (|43| then the "C" and "D" 
matrices of (43i can be chosen so that the state reference signal Xr{k) satisfies (for some U e m("""")^", 
and some L2 £ m("-"")x"^): 



Kc 





Kc 



''pre 



(/C) + 



L2 





r(k). 



(45) 



Therefore, assuming that Kc is of full row rank, an equally valid choice of prefilter is: 



Hpre(z) = 
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- KfC 


Kf 




u 


-1 









U 


-1 








Kc 




Kc 






Kc 










(46) 
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Alternatively, if loop-shifting has been used, 





A + BDkC 


-KfC 


Kf 


-BDk 


Hpre(z) = 




/-1 


-1 









/-I 


-1 










Kc 




Kc 






Kc 










The same principle could also be applied to a pref liter implemented to provide Xr{k + 1 |/c) at time k. 



8 System realisation using standard tools 



8.1 Pre-stabilisation 



The stage cost function ([29) is unusual in that it includes cross terms between the predicted state and 
the predicted input at each time step. Whilst QP matrices for a finite-horizon control problem can easily 
be constructed manually, this structure is not always directly supported by standard MPC design and 
implementation software toolchains. However, prestabilisation ( [Rossiter et al.[|1998"| > can be used. Letting 
u{k) = KcX{k) + r]{k), a change of coordinates transforms the stage cost ([29} into 



i{x,rj) 



X^ 



"o 


o" 




X 





R 




_V_ 



and the prediction model 



x{k + ^) = {A+ BKc)x{k) + Brj{k). 



Input constraints can then be imposed as cross-constraints between inputs and states 
constraints on an artificial plant model with a non-zero "D" matrix. 



(48) 
(49) 

i.e. as output 



8.2 Delay management 

Realising the discrete-time predictor structure in Simulink in a way that could be deployed is simple. At 
time k, x{k +'\\k) should be used to calculate control action u{k + 1 \k). This will then be delayed by a 
period 7s by a "Rate Transition" block configured for "deterministic data transfer" before being applied to a 
continuous-time plant. 
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From a practical perspective, to discrete-time realise a filter structure directly in Simulink requires that 
"Rate Transitions" between a continuous-time "real world" and the discrete-time controller are configured to 
not enforce deterministic data transfer, and that a transport delay is added in the "cut" shown in Figure |6} 
Unlike in an unconstrained "observer-based" controller, where Kc is fixed and can be included directly in 
the observer dynamics, the control move u from the MPC controller must be fed back to the observer after 
calculation. 



G(s) 



Zero order hold 



6 Kc if unconstrainec 



Jl 



Jl 



MPC i OBS « 



Figure 6: Delay (before observer) 



Some delay to account for computation time and to avoid a computational algebraic loop is inevit- 
able. As well as not being a suitable configuration for controller deployment, this leads to a full 
unit delay on input signals fed back to the observer, because the zero-order hold sampling time 
will be "missed". As an alternative, a delay of Tg could be re-introduced everywhere, but if one 
has chosen to use the filter-form observer structure, it has likely already been established that this 
would be unacceptable. However, to ensure deterministic data transfer whilst not requiring a full 
unit delay, multiple sampling rates and conditionally executed subsystems can be used (Algorithm [3j. 
Algorithm 3: Multi-rate system for deterministic transfer 

Data: k, A/div 

begin 

1 Let ? = kTs 

2 Sample y(f) 

3 Calculate x{k\k) = (/ - KfC)x{k\k - 1) + Kfy{t) 

4 Start calculation of MPC control action 

5 t ^ kTs+ Ts/A/div 

6 Output MPC control action u{t) 

7 Use MPC control action u{t) to calculate 

x{k+^\k) = A{l - KfC)x{k\k - 1 ) + Bu{t) + Kfy{t - Ts/N^w) 

8 Wait untiU = (/c -I- 1)Ts. Increment /c. 
end 
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9 Cross coupling 



When using the reverse engineering method, under certain circumstances it is possible to inadvertently 
find an observer gain which introduces coupling between the state estimates of supposedly separate 
subsystems, and a state feedback gain that "removes" the cross-coupling. Consider a closed loop system 
comprised of m identical, parallel, decoupled loops with the plant and controller 



G(z) = 



^1 














Bm 













Cm 








(50) 



K{z) = 
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^Km 




BKm 













CKm 








(51) 



respectively, where A-^ = = ... = Am, B^ = = ... = Bm etc. This is equivalent to m identical, 
independent, closed loop systems. The reverse engineering process aims to cast the controller into an 
observer form, with a state feedback matrix, Kq and an observer gain matrix, Kf. It would therefore not be 
unreasonable to expected that Kc and to also be block diagonal: 



cm 



Kf 



Kf 



fm 



Reverse engineering such a structure would, of course, be equivalent to reverse engineering each of the 
identical subsystems individually. Unfortunately, it is not always the case that this structure is obtained, 
and whilst the nominal input-output characteristics remain decoupled, the reverse engineering procedure 
can introduce internal cross-coupling between the loops through a poor choice of U 



in 1 24 . 



Theorem 8 The eigenvalues of matrix A + BKc correspond to eigenvectors defined by thie columns of U-\ , 
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when the basis for the invariant subspace comprises a selection of eigenvectors of Ad. 
Proof From Theorem |6l consider 

A + BCkT = A+BKc= U:AU;\ 



(52) 



If the diagonal elements of A are the poles of {A + BKc), the columns of U^ are the corresponding 
eigenvectors. ■ 

Lemma 2 The soiution T to ^ or (7^, obtained using the method described in Section\^is unique for a 



given basis Im 



, and is not dependent on the scaling, nor the ordering of the columns. 



Proof Let the columns be transformed by a full rank n x n matrix, X. 









x = 


'U^X 










U2X 



(53) 



Then, 



t=U2U^^ = U2XX^ U^^ = T. (54) 

Therefore, T depends on the span of the invariant subspace S, not on its specific representation. ■ 

Definition 3 (Eigenspace) An eigenspace is the maximal invariant subspace corresponding to a particu- 
lar eigenvalue of a matrix. 

Lemma 3 Assuming no repeated poles within each of the independent loops, there are n = dim(/A/) 
distinct eigenvalues, each of which corresponds to an m-dimensional eigenspace ofA^i — i.e. any linear 
combination of the basis vectors that define the eigenspace is a valid eigenvector 

Proof By construction. Consider each subsystem separately. ■ 

Lemma 4 A sufficient condition for the reverse engineered system to be decoupled is for T to be block 
diagonal, with each block corresponding to an individual subsystem. 

Proof By construction, Ck is of the form 



'K1 



(55) 
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Therefore Kc = CkT is of the form 



c1 



7-11 ••• 7i 



CK^T^^ C/<-iTi2 
Ck2 721 Ck2 T22 

_CKmTm^ CKmTm2 

Therefore, Kc will be decoupled when T is of the form 

"Tii 



1m 
Tmm 

CK^ T^ m 

Ck2 T2m 



(56) 



(57) 



(58) 



(59) 



Remark 15 This is analogous to reverse engineering each of the loops individually. In this case, 
and U2 would also be block diagonal. Intuitively, given that the original controller is being viewed as an 
observer on Tx, it makes sense that the loops should not affect each other 

Proof Remembering that T = U2U^\ if U-\ and U2 are block diagonal in a compatible fashion, then T will 
be block diagonal: 



'11 



'21 



'2m 



U^m 



(60) 



This is effectively reverse engineering the decoupled systems individually. ■ 

Lemma 5 It is not necessary that and U2 are block diagonal for T to be block diagonal. 

Proof As proved in Lemma[2} the solution T only depends upon the choice of invariant subspace, not its 
representation. Therefore, to obtain a decoupled solution, it will suffice that there exists a matrix X such 



Reference: CUED/F-INFENG/TR.671 
Date: 13/09/2011 
Issue: 1 Page: 34 of|56 



that 



U2 



X 



U2^ 



(61) 



U2r 

Therefore, it is not necessary for U-[ and U2 to have any particular structure, merely, for it to be possible to 
construct the desired structure through linear combinations of the columns. ■ 

Proposition 3 If the invariant subspace U used to solve the non-symmetric Riccati equation is constructed 
from a set of complete eigenspaces, inappropriate cross coupling will not be introduced. 

Remarl< 1 6 /n a decoupled system, with m identical subsystems, each distinct eigenvalue will have 
associated with it an m-dimensional eigenspace. Due to the construction of the original system, it is 
possible to interpret each of the m dimensions as corresponding to each of the original decoupled 
subsystems. Therefore, the conditions on and U2 required in Theorem^ will be fulfilled automatically if 
the eigenspaces are not split. 

Proposition 4 If the m-dimensional eigenspaces are only partially used when choosing Uj , 
cross-coupling in Kc is not inevitable under some circumstances. Whether cross-coupling is introduced 
depends on the manner in which the m-dimensional invariant subspaces are split. Consider one of the 
m-dimensional eigenspaces, with basis 



^■2,1 



V2 



(62) 



Because V-\ is of rank m, there exists a m x m transformation matrix Y such that 



v^^ 1/1,2 



V^,m 



Y = 





• 


• 





V2 ■ 


• 





• 


• 





• 





(63) 
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The columns can then be freely exchanged by postmultiplying by a permutation matrix. A subset of the 



columns of 



Y can then be used to form part of the invariant subspace of Ad, 



without causing coupling between the nominally independent loops. 



Theorem 9 If the m-dimensional eigenspaces are split when choosing Uj Uj , such that in neither 
of the resulting subspaces is it possible to obtain a decoupled structure through linear combinations of 
their respective bases, cross-coupling will introduced. 



Proof There exists a transformation matrix Y such that 



Y = 





• 


• 





V2 ■ 


• 





• 


• 





• 





(64) 



wliere r is dense. In tinis scenario, if fewer tlian m columns of 

T 



Y are used to construct 



selected from 
of the columns 



, cross-coupling between the independent loops will be introduced if the number of columns 

T 

Y is fewer then the number of loops for which a non-zero element exists in any 



Remark 1 7 /n other words, cross coupling will be inevitably introduced if any of the eigenspaces is split in 
such a way that neither of the resulting subspaces can be represented in a way that separates the loops. 



Corollary 2 span ( U( L/J 
introduced in Kc. 



only contains complete eigenspaces of Ad, cross-coupling will not be 



Proof If none of the n, m-dimensional eigenspaces are split (i.e. they are used to construct U in their 
entirety or not at all), the pole allocation between observer and feedback will be equivalent to the reverse 
engineering, then recombination of each of the loops individually. Suppose that cross-coupling between 
the loops is introduced. Then, there needs to be a representation of the invariant subspace of Ac\, 
Uj ul such that cross coupling is introduced. However, for any given invariant subspace for which a 
solution T exists, T is unique. A decoupled solution is known to exist, and this must, therefore, be the 
only solution. ■ 
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10 Case Studies 



10.1 Spacecraft Attitude Control — Fault robustness using redundant actuators 



This example demonstrates the procedure on the sampled-data output-feedback controller from (Sidi 



1997, ex 9.4.1) for a single axis attitude control system with angle measurement only, and shows how the 



implicit daisy-chaining of constrained MPC (I Maciejowskij |1 99 81 can be exhibited by a reverse-engineered 
controller. A continuous-time linear model is used for simulation, and its zero-order-hold discretisation 
used for prediction. 



10.1.1 Model 



The plant is modelled as an ideal inertial load with a moment of inertia J = 500 kg m^. To demonstrate 
how reverse-engineered MPC can be used to add extra functionality to the original controller, a redundant 
torque pair input is added, and the model is also augmented with a constant torque disturbance state. 
The sampling period Tg = 0.25 s. 





1 


0.25 


0.00358 


0.00358 


0.00358 ' 


G{z) = 





1 


0.02865 


0.02865 


0.02865 


















1 










0.01745 















(65) 



10.1.2 Baseline controller 



The baseline controller does not use the second torque pair. 





' 1.412 


-0.8235 


32 


Ko{z) = 


0.5 








13.01 


-26.14 


-871 














(66) 



and has poles at 1 (integral action), 0.41 and zeros at 0.98 and 0.91 . 
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10.1.3 Observer-compensator realisation 

The magnitude of the Dk term is large in comparison to other terms in the controller matrices, and 
the controller is being designed to counteract external disturbances, so the estimate of y may have 
significant error during transients caused by these disturbances. Figure [7] shows the observer error 
dynamic responses for each of the possible realisations when loop-shifting is used — it is clear that the 
error is substantial. Loop shifting is therefore not considered an option for this system if input constraints 
are to be enforced. Adding a unit delay is also not an option because it noticeably changes the system 
response step response to the disturbance (Figure [8). 



-0.08 
-0.1 - 











Angular velocity (rad) 

- - - Angle (rad) 

Disturbance (Nm) 



-0.08 
-0.1 - 



10 15 20 25 

Time /s 



-0.08 
-0.1 



Figure 7: Observer error dynamics using loop-shifting 



0.035 
0.03 



_ - Angular Position (deg) 
_ Kg - Angular Velocity (deg/s) 

- UStiifl - Angular Position (deg) 

- UStiifl - Angular Velocity (deg/s) 
Delay - Angular Position (deg) 

■ Delay - Angular Velocity (deg/s) 




20 25 




Figure 8: Comparison of loop shifting, unit delay and Kq closed loop performance 

The remaining option is to obtain a filter-form observer-compensator realisation. Whilst Kq{0) 7^0, so a 
dipole can be added to make this condition hold: 



K,{z) = Ko{z) 



50z 

50z- 1 



(67) 



Reassuringly, the closed loop poles do not move significantly (Table Q}. There are three plant states 
(including the disturbance), and, now, three controller states. The disturbance state is uncontrollable, and 
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Table 1 : Closed-loop poles for unconstrained attitude control system 

Original With Dipole 

1 1 
0.9764 0.9764 
0.9115 ±j0.11 92 0.9086 ±j0.1 204 
0.5579 0.5660 
0.0177 



the complex pole pair cannot be split, so there are four possible choices of T giving the realisations in 
Table[2]— "S" is used to mean a pole placed in the state feedback, and "O" the observer error dynamics. In 
reality there will be a small transport delay between measurement and application of the control action 
due to computation not being instantaneous. For this reason, the phase margin and the delay margin, 
having broken the loop just after the MPC calculation (but before feeding back to the observer and plant) 
are considered. 

Table 2: Observer-based realisations for attitude system 





R1 


R2 


R3 


R4 


0.0177 





S 


S 





0.5660 





S 





S 


0.9086 + j0.1 204 


s 











0.9086- jO.1 204 


s 











0.9764 








S 


S 


1 


s 


S 


S 


S 


Delay margin 


4.36 7"s 


0.51 Ts 


0.98 7"s 


2.837s 


Gain margin 


11.67 


1.66 


2.01 


4.42 


\\Gy^e{z)\\2 


29.95 


12.31 


18.89 


89.05 


l|Gtf^x(^)l|2 


5.03 


5.59 


3.17 


2.99 


||Gr^ell2 X llGd^j(||2 


150.5319 


68.7844 


59.8672 


89.0512 



Superficially, the data from Table [2] suggests that realisation R1 is a sensible starting point for development 
of an MPC controller. The fastest poles are in the observer error dynamics, and the delay margin and 
the gain margin at the artificial "cut" are excellent. Furthermore, if implemented using a single discrete 
sampling rate (e.g. a basic implementation in Simulink), realisations R2 and R3 become unstable because 
the inevitable delay causes a zero-order hold sampling time to be "missed" introducing a whole unit delay. 

However, Proposition |2] suggests something rather different. In fact the observer in R1 is very slow to 
estimate unmeasured disturbances (resulting in large errors in the estimates of other states). R4 does not 
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have this issue, however, it amplifies measurement noise (albeit absent in this example) in an unpleasant 
manner. Therefore, the best overall realisation is R3. If the information flow is managed carefully (e.g. 
using multiple sampling rates in Simulink, as proposed in Algorithm|3}, the computation delay need only 
be a small fraction of the sampling period, and can be moved to "after" the point at which the calculated 
control action has been fed back to the observer, but before the plant — thus recovering the behaviour of 
the original controller even for R2 and R3, and rendering the calculated "delay margin" at the original "cut" 
(Figure [6} irrelevant. 



10.1.4 MPC implementation - Cost function and constraints 



Table [3] enumerates the controller configurations used to produce the closed loop responses to the step 
disturbance depicted in Figure[9) When cost function ([29) is used, R = I. When cost function (|32) is used, 
R^ = 10^/ and Q-\ = 10^/ to indicate a strong weighting for matching the "effect" of the control action on 
the state and a weak weighting on the exact original control configuration. 

Table 3: Reverse engineered MPC with input constraints 



Ko 



Realisation 

Cost function - 

Prediction/Control Horizon - 

Constraints - 



Case 1 
R3 



Case 2 
R3 



29 



29 



15 



15 

\Ui\ < 0.11 



Case 3 
R3 
32 
15 

\Ui\ < 0.11 




Figure 9: Reverse engineered MPC with input constraints 



Figure |9] shows that in unconstrained MPC Case 1 , the matching with the behaviour of the original output 
feedback controller is close although there is a slight difference, attributed primarily to a Ts/^0 delay 
introduced to ensure deterministic data transfer {Kq is assumed to be implemented instantaneously). In 
MPC Case 2, input constraints are deliberately imposed at a level lower than the peak of the unconstrained 
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input trajectory. Whilst tine second torque pair can be seen to boost tine correction of tine disturbance, 
tiiis is sliglitly slower than the unconstrained case. Nevertheless, this demonstrates that the objective 
embedded in the cost function ([29| could be adequate in this scenario. In MPC Case 3, where the cost 
function better encodes the consequences of the control action rather than its specific realisation, the 
output trajectory is identical to the unconstrained case, with the second torque pair being used to exactly 
match the net torque trajectory of the unconstrained case. 



10.1.5 MPC implementation - Output constraints and fault recovery 



In MPC Cases 4 and 5, an output constraint is also added to constrain the angle y < 0.01 , representing an 
attitude pointing requirement. To ensure feasibility of the optimisation problem despite observer estimation 
error, these are softened with a quadratic weighting of 10^. In Case 4, the input constraints are relaxed, 
and in Case 5, an unmodelled plant failure is introduced at f = 3 s. 

Table 4: Reverse engineered MPC with state constraints 



Ko 



Case 4 



Case 5 



Realisation Ko R4 R4 

Cost function - 29 32 

Input Constraints - < 1 < 0.15 

State Constraints - \y\ < 0.01 |y| < 0.01 

Torque 1 failure - - 3 s 




Figure 10: Reverse engineered MPC with state constraints and unmodelled plant failures 



Because of the constraint softening and the inevitable state estimation error, constraints are not enforced 
exactly, however. Figure [TO] shows that the violations are not large. 
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10.1.6 MPC implementation - Tools 

The constrained reverse-engineered IVIPC controller is implemented in Simulink using a custom condensed 
QP builder that accommodates cross-terms between input and states (i.e. equality constraints for 
state dynamics are eliminated to form a dense QP). The QP is solved using an Embedded MATLAB 
implementation of the dual active-set algorithm of ( [Goldfarb & ldnani[[T983[ . The prediction horizon is 15, 



with 4 input constraints per time step, and 2 (softened) output constraints per time step, leading to a QP 
with 45 decision variables (the slack variable is shared between the upper and lower bound constraints on 
outputs) and 90 inequality constraints. Measured using the Simulink Profiler tool, the QP solver takes on 
average 0.3 ms to solve on a 2.8 GHz Mac Pro running MATLAB R2010b on Scientific Linux 6 in a virtual 
machine using a single core. This is small in comparison to the 7"s/10 = 0.025 s that has been allowed for 
computation in these simulations. Assuming an approximately linear scaling with clock speed, this means 
that a 40 MHz processor could be sufficient to implement the reverse-engineered constrained controller in 
real-time. The subdivision of the sampling period to approximate the direct feed-through is therefore a 
demonstrably practical option. 



10.2 Inverted Pendulum on a Cart — Output constraints with an unstable plant 

This example demonstrates the procedure when a local linearisation of a nonlinear model is used for 
prediction, and shows that output constraints can be enforced as with conventional MPC design. Unit 
(1 m) step changes in the cart position reference are tracked using the method presented in Section[7| 
whilst maintaining pendulum stability. 

9, 9 

X, X \ 

' \ J — ^ 

Figure 11: Cart-pendulum model 



10.2.1 Model 



A non-linear continuous-time plant model is used for simulation, whilst a linearisation about the unstable 
equilibrium point is used for prediction. The model data and the baseline controller are taken from 
( [Goodwin et al.[[2001| ). The pendulum mass m = 0.5 kg, cart mass M = 0.5 kg and pendulum length 
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/ = 1 m. The system input is a force applied to tine cart, wliilst tine system measured outputs are (in order) 
tile position of tlie cart and tine angular deviation from vertical of the pendulum. The plant states are 
cart position x, cart velocity x, pendulum deflection 9 and pendulum angular velocity 9 (Figure [lT|- The 
nonlinear model equations in state space form are: 



{M + m)x + ml9 cos 9 - ml9^ s\n9 = u 

-19 + gs\n9 = xcos9. 

Rearranged into a nonlinear state-space form: 

m/^^ sin 6* - ATig sin ^ cos 6* + t/ 
{M + ms\n^9) 
■■ osin6' - XC0S6' 

— I • 



(68a) 
(68b) 



(69a) 
(69b) 



The continuous-time linear model is obtained by linearised about the upwards-facing equilibrium point, is: 



G(s) = 






1 

















mg 
M 





1 

M 











1 











{M+m)g 
Ml 





1 

Ml 


1 




















1 









(70) 



10.2.2 Baseline linear controller 



The baseline controller is specified in continuous-time as a MISO transfer function. Assuming positive 
feedback, 



Ko{s) 



4(5+0.2) 150(5+4) 
(s+5) (5+30) 



(71) 



10.2.3 Reverse engineered controller 

The baseline controller Ko(s) must be discretised before proceeding. This leaves a choice for the value of 
Tg. Kq has two stable, real poles of frequency 5 rad s^'' and 30 rad s ^ The linearised plant model has 
two poles at the origin of the s-plane, one unstable pole of frequency 4.43 rad s^ and a symmetric stable 
pole. The objective is to reproduce the original system response rather than to just stabilise Go(s), so we 



Reference: CUED/F-INFENG/TR.671 
Date: 13/09/2011 
Issue: 1 Page: 43 of|56 



choose Ts = (27r)/(2 X 30) « 0.1 s. Discretised using a Tustin transformation, tine discretised baseline 
controller is: 





0.6 





3.2 





Koiz) = 





-0.2 





25.6 




-0.384 


-2.438 


3.232 


72 



Once more Dk is nonzero (as would also be the case if a zero-order hold were to be used). The baseline 
controller does not tolerate a delay of Tg added in the loop. (Using the continuous time controller, this 
leads to oscillations. In discrete time, it leads to instability.) A transport delay of 0.01 s is tolerated though. 
(The sampling period could, of course, be reduced so that a delay of a smaller Ts is tolerated, however 
this can cause a longer prediction horizon (in time steps) to be needed to obtain stability when constraints 
are imposed, meaning a larger optimisation problem and heavier computational requirements.) 

Assuming reduction of the sampling time is not acceptable, there are two options — loop shifting (obtaining 
a predictor form observer), and dipole introduction (obtaining a filter form observer). In this application, 
large external disturbances are not expected, and input constraints are not being considered. Loop shifting 
will allow more time for computation — DKy{k) can be calculated very fast in comparison to solving the 
MPC QP, and the QP is formed using estimate x{k\k - 1) so can commence at the previous time step. 
(The loop shifting could even be implemented in continuous-time before discretisation). 

There are 6 closed-loop poles in the original system (n = 4, hk = 2). The 2 (initially invisible) additional 
modes introduced in the observer are placed by designing a Kalman filter for system ( [2T| with Q = 1 
and R = 10^/ — i.e. assuming that most uncertainty comes from measurement noise. The possible 
realisations are shown in Table [5j Realisation CPR2 is chosen, having lowest gain from the output to 



Table 5: Cart-pendulum closed-loop poles and relisations 





CPR1 


CPR2 


CPR3 


0.2416 + j0.5304 


S 





S 


0.2416- j0.5304 


S 





S 


0.7832 + j0.0630 


S 


S 





0.7832 - j0.0630 


S 


S 





0.8800 





S 


s 


0.9708 





S 


s 


New poles 


0.354 ±j0.624 


0.242 ±j0.530 


0.515±j0.763 


\\Gy^y\\2 


19.61 


3.62 


6.59 



the estimate of the output. 
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10.2.4 Constrained MPC realisation 



Due to the loop-shifting, the state in the prediction model is augmented with the signal D^-r, as this 
is required for correct prediction and enforcement of input constraints when the reference setpoint is 
non-zero, as well as the cost function (|29l is modified to penalise - Kc{x - Xr))\\2- 



Because Ko(z) was in the forward path, pre-filtering is necessary when the observer is in the return path. 
The form (|47) is used to ensure that the pendulum angle and angular velocity are not included in the state 
reference trajectory Xr. The control gain Kc is of size 1 x 4, so it is possible to force 3 states in the filtered 
reference state to be equal to arbitrary values. The matrices 



10 
10 
1 



L2 









(73) 



constrain the cart reference velocity, and the angular state references to be zero. All shaping is done 
using the cart position reference. Figure [12] shows three prefilters. Prefilter 1 is implemented using ( [44} . 
Prefilter 2 is implemented using ([47) and the values of U and L2 above. 

Remark 1 8 /An alternative would be to set 



U = 



10 
10 
1 



/-2 = 



1 





(74) 



causing the position reference to be equal to its nominal setpoint value, and angular state references to 
be zero. Reference shaping is performed using the cart velocity reference. 



Prefilter 3 uses and L2, chosen as in Remark[T8] Prefilter 2 is used for MPC simulations because any 
point of the filtered reference trajectory is a stable equilibrium. 

Remark 19 /\ predictor-form observer is used to estimate x{k + ^\k), from which the MPC determines 
a control action to apply at time k + ^. At time k = the MPC cannot contribute to the control action. 
Figure\l^shows a zero-order hold plot of the pendulum angular velocity for the first 2 s of simulation. The 
direct feedthrough due to loop-shifting acts att = 0, violating constraints att = 0.^. However, the MPC 
controller quickly corrects this byt = 0.2 and thereafter If a filter structure with a dipole had been used, 
this behaviour would not be exhibited. 



With a prediction horizon of 20, 2 input constraints and 6 output constraints and 3 slacks for constraint 
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Cart pos. ref. 





■ Prefilter 1 

■ Prefilter 2 
Prefilter 3 



5 10 15 20 
Time /s 
Ang. ref. 



5 10 15 
Time /s 



20 



5 10 15 20 
Time /s 



1.5r 



0.5 


-0.5 



Ang. vel. ref. 



IT 



5 10 15 
Time /s 



20 



Figure 12: Cart-pendulum prefilter realisations 





Figure 14: Initial delay 
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Table 6: Cart-Pendulum simulations 



Ko(s) 



Ko(z) 



Case 1 



Case 2 



Ts 

Realisation 
Prefilter 
Horizon 



Continuous 0.1 s 



0.1 s 
CPR2 



47 / 73 
15 



0.1 s 
CPR2 



47 



A 73 
15 















' 0.7 ■ 




Constraints 






None 


1^1 


< 


0.175 












_\e\_ 




0.3 





softening per time step, a dense QP with 60 decision variables and 90 constraints is formed. With the 
same setup as described previously, a solution time of 0.25 ms is possible. Linear scaling indicates that 
this could be implemented on very modest hardware indeed. 



10.3 Control of a Large Airliner — MIMO control of a highly cross-coupled plant 

The third example extends ( [Joosten & Maciejowski[ |2009| by using Proposition [2] as the heuristic for 
choosing a solution T that yields a suitable observer realisation. Furthermore, in the case of plant failure, 
the cost function (|32) better encodes the contingency objectives than (|29} whilst retaining fidelity in the 
nominal case. 



10.3.1 Model 



The plant model and baseline controller is provided by the simulator of a Boeing 747-100/200 that 
was used by the fault tolerant control group (AG16) of the Group for Aeronautical Research Europe 
(GARTEUR) dvan der Linden||1996t[Maciejowski & Jones| [2003| [Joosten & Maciejowski||2009t [Edwards 
et al.[[2010t[van der Linden et aTj[2011[ ). Of interest for this demonstration, are 14 plant states (excluding 
lateral position) and 27 individually addressable control surfaces, consisting of 4 ailerons, 12 spoiler 
panels, 2 rudders, 4 elevators, 4 engines and a stabiliser. 



The measured outputs are: roll rate (p), pitch rate (q), yaw rate (r), true airspeed (\/tas)' '"oH angle ((/>), 
pitch angle (6), yaw angle (V'), height (h) and rate of height change (/?). 



The prediction model is obtained by averaging two empirically linearised models (linearised with opposite 
sign deflections) about a trimpoint obtained in continuous time using the Simulink linmodvS command to 
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Table 7: States and Inputs 



(a) States (b) Control surfaces 



State 


Symbol 


Unit 


ID 


Description 


Roll rate 


P 


rad s"^ 


1 


Right inboard aileron 


Pitch rate 


Q 


rad s"'' 


2 


Left inboard aileron 


Yaw rate 


r 


rad s^'' 


3 


Right outboard aileron 


True airspeed 


^TAS 


m s^'' 


4 


Left outboard aileron 


Angle of attack 


a 


rad 


5-10 


Right spoiler panels 


Sideslip 




rad 


11-16 


Left spoiler panels 


Roll 


4> 


rad 


17 


Right inboard elevator 
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obtain an empirically linearised model about a trim point. To further avoid the effects of nonlinearities, 
interaction between lateral and longitudinal states is explicitly nullified in the linearised model by setting the 
relevant elements of the state update matrix to zero. The open-loop poles of the linearised continuous-time 
system range from to 0.18 Hz. The baseline controller in ( [van der Linden et aTj |201 1 [ ) is a family of 
switched single-loop linear controllers with parameter-varying gains and rate limits and saturations. To 
simplify the reverse engineering, the altitude select in series with pitch select, heading select in series with 
roll select, airspeed controller and yaw damper have been extracted, implemented as a block-diagonal 
MIMO controller and linearised about the operating point (i.e. with the scheduled gains locked to their 
values at the trim point). 



The plant and baseline controller are both discretised with a sampling period Tg = OA s. The plant is 
discretised using a zero-order hold and the controller with a Tustin transformation. The linear model 
dimensions are provided in Table [8] 



Table 8: Plant sizes 
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10.3.2 Reverse engineered controller and observer performance 

Whilst the original control system is implemented as four separate loops considering height, roll, yaw 
damping and air speed separately, these loops are not wholly independent, being coupled by the plant 
dynamics. 

Figure[l5]shows the non-zero elements of the discretised state-update matrix, and also shows a realisation 
in a re-ordered form to better show the structure of the system. The original control loops are also coupled 
by the inputs. 



Nonzero elements of Nonzero elements of reordered 




5 10 15 5 10 15 

State State 



Figure 15: Cross terms in linearised B747 model 

The baseline controller includes a number of integrators for offset-free tracking. To cast this into an MPC 
framework, the linearised model is augmented with 7 disturbance states, modelling constant disturbances 
added to the first derivatives of p, q, r, I/jas- angle of attack (a), 6 and ip. This is as many as can be 
tolerated whilst maintaining observability of pair {C,A), and the chosen combination has been chosen 
experimentally to give the best observer performance. The baseline controller is not strictly proper 
(K'o(O) ^ 0), and given the number of states and inputs, and therefore the relatively high complexity of 
solution of the MPC problem, adding a unit delay is the chosen method for obtaining a strictly proper 
realisation. 

There are up to ^^C-\j possible realisations. However, the 7 uncontrollable disturbance states must stay in 
the state-feedback dynamics, reducing this to ^^C^7 ^ 2.7 x 10^. Further reductions in this number can 
be made by noting that 9 of the closed-loop poles are complex conjugate pairs, reducing the number of 
combinations to 431415. A further reduction can be obtained by noting that the system 
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has a further three uncontrollable modes in addition to the disturbances, corresponding to three poles at 
z = 0.9512 (so these must remain in the state feedback dynamics), reducing the number of combinations 
to a somewhat more tractable total of 41958. The search using the metric in Proposition |2]can be carried 
out in 181 s using a script written using standard MATLAB Control Toolbox commands for analysis of LTI 
systems on a modestly specified desktop workstation. Of the possible realisations searched, 8143 have 
feasible solutions of T. 

To demonstrate how critical the "correct" choice of realisation, Table[9]shows three possible combinations. 
The first considers the first realisation found in the search (i.e. an arbitrary choice). The second considers 
the minimisation of the sum of the absolute values of the observer poles (in theory, the fastest observer). 
The third considers the metric of Proposition [2] 

Table 9: Realisations of B747 controller 



Realisation 1 



Realisation 2 



Realisation 3 



Min sum abs poles Proposition [2] 

10.6207 13.2767 
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2875 598.5 
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Figure [T6]graphically presents the open-loop pole distribution, and the closed-loop observer and state 



feedback pole allocations graphically for each realisation. Figure 17 shows the observer performance for 
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Figure 16: Observer-based realisations for Boeing 747 controller — Pole maps 



estimation of sideslip (/3, unmeasured), roll angle (measured), yaw angle (measured) and engine power 
for one engine (unmeasured) in response to unmeasured turbulence, and Gaussian measurement noise. 



with the standard deviations shown in Table qO] for 60 s of nominally "straight-and-level" flight, under the 
control of the discretised baseline controller. It can be seen that Realisation 2 (supposedly maximising the 
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Figure 17: Observer performance with different realisations (straight and level flight with turbulence and 
sensor noise) 



"speed" of the observer) is actually worse than the arbitrary choice in Realisation 1 . However, with both of 
these realisations, the measurement noise and disturbances are amplified to the the point that the state 
estimate is meaningless, rendering the MPC controller no more useful than a rather baroque realisation 
of Ko(z). On the other hand. Realisation 3 provides a very good filtered estimate of the measured states, 
and clear tracking of the salient features of the unmeasured state trajectories. 



Table 10: Measurement noise 3a values (units as per states) 
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10.3.3 Controller performance 

Unconstrained matching in the tracking a piecewise linear trajectory of yaw, height and true airspeed 
setpoints using cost function ( |32j ) and Realisation 3 is shown in Figure [18} It is of no surprise that the 



reverse-engineered controller matches the behaviour of the simplified baseline controller — the random 
number seeds used for sensor noise and turbulence are identical for each simulation, and the controllers 
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Figure 18: Tracking trajectories 



should behave identically. Figure [T8]also shows the closed-loop trajectory when all ailerons, spoilers 1 
and 4-8, left inboard and right outboard elevators are locked at their trim positions. The baseline controller 
is unable to perform the yaw manoeuvre, although even this is robust enough to track height and \/tas 
well, due to integral action. However, it can be seen that despite the removal of these degrees of control 
freedom, that the reverse-engineered controller, given knowledge of the constraints and a prediction 
horizon /V = 1 0, still has access to sufficient control authority to approximately track the required trajectory, 
although there is some loss of performance. 



If cost function ) [29| were to have been used, the trajectory would have been very oscillatory and failed 
to track the trajectory. This is clear from the interpretation of the cost function. When faced with control 
surfaces constrained to zero, ([29]| will attempt to regulate the plant state to a subspace where the relevant 
elements of u = KcX are zero. For example, when performing a change in heading, it is natural for the 
aircraft to perform a roll, but cost ([29) will try to counteract this due to the locked aileron. On the other 
hand, ([32) will use other degrees of freedom (i.e. the spoiler panels) to as best as possible effect the 
"intention" of t/ = KcX even if the realisation is rather different. 



11 Conclusions 



A method has been presented for obtaining a constrained MPC controller that behaves in the same way 
as an existing LTI output-feedback controller when constraints are not active. The method accounts 
for offset-free output tracking and can be implemented using standard tools. A new heuristic has been 
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presented for choosing the non-unique observer-based realisation upon which the IVIPC controller is based. 
Three examples of different complexity are presented to demonstrate the efficacy and usefulness of the 
scheme. Two of these apply to an approximately linear operating region of a non-linear plant. The resulting 
MPC controllers are able to enforce constraints, and use disturbance estimation and plant redundancy to 
provide a level of reconfigurability and robustness to plant failures, as would befit a constrained predictive 
controller designed from scratch. 
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